TITLE OF THE INVENTION 

IMAGE READING APPARATUS 



BACKGROUND OF THE INVENTION 

1 . Field of the Invention 

[0001] This invention relates to an image reading apparatus, and more 
particularly to an image reading apparatus to read an image which contains 
character information and to easily determine the language of the original based 
from the characteristic of the image information of the original. 

2. Description of the Related Art 

[0002] When characters are recognized in the image reading apparatus using an 
OCR (optical character reader), etc., the OCR engine is necessarily used for each 
language of the characters contained in the original to be read. To attain this, the 
language of the character in the original to be read is manually set in advance by 
the user of the apparatus before a read. Therefore, the user has the problem that 
the apparatus is not easily operable. 

[0003] To solve the problem, an apparatus for automatically determining a 
language has been suggested by loading OCR engines for a plurality of languages 
on the OCR, actually recognizing the characters in an original using the OCR 
engine for each language, selecting the language having the highest probability of 
correct determination (for example, patent document #1: Japanese Patent 
Application Laid-Open No. 6-150061). 

[0004] This is as shown in FIG. 7. That is, an image input device 50 reads a 
character written in an original as image data, and transmits the data to a 

1 



character recognition process unit 51. The character recognition process unit 51 
has a plurality of OCR engines for a plurality of languages for character 
recognition. For example, as shown in FIG. 7, a Japanese OCR engine 52 
performs a pattern matching process between the received image data and the 
character pattern in a Japanese character pattern dictionary 53, thereby executing 
character recognition. Then, an English OCR engine 54 performs a pattern 
matching process between the same image data and the character pattern in an 
English character pattern dictionary 55 to recognize characters. From the result 
of the character recognition by the OCR engine of each language, the probability of 
correct determination indicating the probability of correct recognition is obtained, 
and is transmitted to a language determining process unit 56. Based on the 
probability of correct recognition transmitted from the OCR engines for a plurality 
of languages, the language determining process unit 56 determines the language of 
the highest probability of correct recognition as the language of the characters 
contained in the original. 

[0005] In addition, to prevent wrong determination, a similar determining 
process is performed on a plurality of characters contained in the original, and a 
statistical process is also performed to determine the language of the highest 
probability of the language of the characters contained in the original as the 
language of the characters contained in the original. 
[0006] However, in such apparatus, since character recognition and 
determination are repeatedly performed by the a plurality of OCR engines for a 
plurality of languages, the process requires a long processing time. 
[0007] Furthermore, the above-mentioned language determining process is to be 
performed as a preprocess for character recognition. Then, it is desired to realize 
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the function by hardware to complete the process within a shortest possible time. 
However, it is very hard to realize the function of the plurality of OCR engines for 
a plurality of languages and the character pattern dictionaries for the languages 
by hardware. 

[0008] As described above, the conventional technology has the following 
problem. That is, when an image reading apparatus recognizes a character using 
an OCR, etc., it is necessary to use an OCR engine specialized for the language to 
be recognized or used in an original. Therefore, a user manually sets the 
language of the original when an image is read, so that it is troublesome for the 
user to operate such apparatus. 

[0009] To solve the problem, there has been an apparatus suggested for 
automatically determining the language of the characters contained in an original 
by loading a plurality of OCR engines for a plurality of languages, performing 
character recognition by the OCR engines for the respective languages, and 
selecting a language of the highest probability of correct determination, thereby 
automatically determining the language of the characters contained in the 
original. 

[0010] However, the apparatus has to actually perform character recognition by 
the plurality of OCR engines for the plurality of languages each time an original is 
read, and requires a long processing time. Furthermore, to shorten the 
processing time, it is preferable to realize the function by hardware, but it is hard 
to realize the OCR capability by hardware. 



SUMMARY OF THE INVENTION 
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[0011] It is an object of the present invention to provide an image reading 
apparatus capable of easily and automatically determining a language of the 
characters contained in an original without performing character recognition using 
an OCR, etc. 

[0012] To solve the above-mentioned problem, an image reading apparatus of 
the present invention includes labeling process unit, overlap integrating process 
unit and language determining process unit. The labeling process unit groups a 
continuous black pixel area from binarized monochrome image, and extracts group 
bounding rectangle information containing the grouped continuous black pixel 
area for each group. The overlap integrating process unit determines the overlap 
between the group bounding rectangles contained in the row rectangles, and 
integrates the overlapping group bounding rectangles into one group bounding 
rectangle. The language determining process unit obtains a ratio of the number 
of group bounding rectangles contained in the row rectangle before performing an 
overlap integrating process to the number of group bounding rectangles contained 
in the row rectangle after performing the overlap integrating process, and 
determines a language based on the ratio. 

[0013] Preferably the image reading apparatus of the present invention further 
includes row extracting process unit which analogizes a sequence of characters in 
the original from the position relationship of the group bounding rectangle, detects 
a row, and extracts row rectangle information when graphics and pictures are 
contained in a document to be read. 

[0014] Preferably the image reading apparatus of the present invention further 
includes binarizing process unit which converts image data input through an 
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image input device such as a CCD into binary image data of two gray levels of 
black and white when the input image data is multi-valued image data. 
[0015] Preferably the image reading apparatus of the present invention further 
includes statistical determination process unit which performs the 
above-mentioned determining process on all rows contained in the original, 
performs a statistical process so that the language determined to be the language 
of the characters contained in the most rows can be selected as the language of the 
characters contained in the original. 

[0016] When a document to be read contains only characters, the labeling 
process, the overlap integrating process, and the language determining process can 
be performed on the entire original. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0017] FIG. 1 shows the configuration of the present invention. 

FIGS. 2A and 2B are an explanatory view of the labeling process. 

FIG. 3 is an explanatory view of the case in which group bounding 
rectangles are linearly arranged in the X direction. 

FIG. 4 is an explanatory view of the case in which group bounding 
rectangles are linearly arranged in the Y direction. 

FIGS. 5A, 5B and 5C are an explanatory view of the overlap integrating 
process. 

FIGS. 6A and 6B is an explanatory view of the process of obtaining an 
overlap integration ratio. 

FIG. 7 is an explanatory view of the conventional determining process of the 
language of the characters contained in an original. 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 
[0018] The present invention is embodied as follows. The image reading 
apparatus of the present invention has a binarizing process unit. When the 
image data input from an image input device such as a CCD is multi-valued image 
data, the binarizing process unit converts the image data into monochrome binary 
image data or the data of two levels of black and white. Thus, although the image 
data read by an image input device is the data of a color image or a multilevel gray 
scale image, the subsequent image processing such as the labeling process, etc. can 
be simplified. 

[0019] The image reading apparatus has a labeling process unit which groups a 
continuous black pixel area from binarized monochrome image by determining a 
sequence of black pixels, and extracts group bounding rectangle information 
containing the grouped continuous black pixel area for each group. Thus, a 
continuous area of character components of lines, dot, etc. forming a character can 
be grouped, and group bounding rectangle information can be obtained for each 
group. 

[0020] The image reading apparatus has a row extracting process unit. When 
graphics and pictures are contained in a document to be read, the row extracting 
process unit extracts row rectangle information by determining the position 
relationship of each group bounding rectangle based on the group bounding 
rectangle information extracted by the labeling process unit. This prevents an 
occurrence of wrong determination by determining a language by processing the 
graphics and pictures contained in a document to be read as well as the characters. 
Thus, only the characters are processed when determining a language. 
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[0021] The image reading apparatus has a overlap integrating process unit 
which determines the overlap between the group bounding rectangles contained in 
the row rectangles, and integrates the overlapping group bounding rectangles into 
one group bounding rectangle when the overlap between the group bounding 
rectangles is detected, thereby extracting a group bounding rectangle. Some 
languages such as Japanese has the feature that lines and dots forming single 
character are separated each other. Thus, a larger number of groups can be 
integrated for Japanese than for English, etc. by the overlap integrating process 
unit. This integrating process realizes the effect of extracting the feature of a 
language for identification of a language. 

[0022] The image reading apparatus has a language determining process unit 
which obtains a ratio of the number of group bounding rectangles contained in the 
row rectangle before performing an overlap integrating process to the number of 
group bounding rectangles contained in the row rectangle after performing the 
overlap integrating process, and determines a language of the characters 
contained in the row based on the overlap integration ratio. Thus, a language can 
be determined based on the differences in overlap integration ratio among 
languages. 

[0023] The image reading apparatus has a statistical determination process unit 
which performs the above-mentioned determining process on all rows contained in 
the original, performs a statistical process so that the language determined to be 
the language of the characters contained in the most rows can be selected as the 
language of the characters contained in the original. Thus, although wrong 
determination occurs depending on the character data in the row contained in an 
original, a final mistake in automatic determination of a language can be avoided 
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by performing the language determining process on a plurality of rows, thereby 
successfully performing the automatically determination of a language with a 
higher probability of correct determination. 

[0024] When a document to be read contains only characters, the labeling 
process, the overlap integrating process, and the language determining process can 
be performed on the entire original. 

[0025] Described below are the typical embodiments of the present invention. 
In the following explanation, the same component is assigned the same reference 
numeral, and the detailed explanation can be omitted for suppression of 
overlapping descriptions. 

[0026] The present image reading apparatus can automatically determine a 
language of the characters contained in the document image data read by an image 
input device such as a CCD, etc. 

[0027] As shown in FIG. 1, the image reading apparatus has an image input 
device 1 such as a CCD, etc., and reads an image of an original as electronic data. 
The image input device 1 may read or input a color or multilevel gray scale image. 
In this case, the read image data is represented by multivalues (8 bits, 24 bits, 
etc.) for information per pixel. 

[0028] A binarization unit 2 converts the multivalued image data into binary 
image data of a monochrome image. The binarizing process is performed by a 
method in which the brightness of a pixel represented by multi-values is defined as 
1 when it is equal to or larger than a predetermined threshold, and as 0 when it is 
smaller than the threshold. 
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[0029] The monochrome image data converted into binary data is transmitted to 
a labeling process unit 3 which executes a labeling process of grouping a 
continuous black pixel area. 

[0030] The labeling process is as follows. First, as shown in FIG. 2A, a 
sequence of black pixels is determined and grouped the continuous black pixel area 
as one unit, as indicated by a range enclosed by the diagonal lines in FIG. 2A. 
Then, as shown in FIG. 2B, bounding rectangle in a continuous black pixel area is 
extracted for each group to obtain group bounding rectangle information for each 
grouped continuous black pixel area. 

[0031] When a document to be read contains graphics, pictures, etc., it is desired 
to use group bounding rectangles which contain characters only, since the 
rectangles are used for determination of language. Therefore, a row extracting 
process unit 4 is provided to extract a row formed by only characters based on the 
position information of the group bounding rectangle extracted in the labeling 
process. 

[0032] The row extracting process unit 4 determines whether the characters are 
linearly arranged in the X direction for predetermined intervals of rows as shown 
in FIG. 3, or linearly arranged in the Y direction as shown in FIG. 4 based on the 
position information about the group bounding rectangle extracted in the labeling 
process based on the feature that characters can be linearly arranged in the X 
direction as shown in FIG. 3, or the characters are linearly arranged in the Y 
direction as shown in FIG. 4. And, the row extracting process unit 4 extracts the 
row rectangle information by defining the strings of group bounding rectangles 
linearly arranged as rows. 
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[0033] Thus, overlap integrating process unit 5 determines the overlap between 
group bounding rectangles contained in the extracted row rectangle, integrates the 
groups in which the overlap is detected into one group, and extracts one newly 
integrated group bounding rectangle. 

[0034] The overlap integrating process is performed as shown in FIG. 5. That 
is, as shown in FIG. 5, group bounding rectangles are extracted from the three 
characters, whose originally shapes are shown in FIG. 5C, in the labeling process, 
and the overlap is determined according to the position information about the 
group bounding rectangles. These three characters are example images of 
characters in which lines and dots forming the character are separated each other 
such as Japanese. 

[0035] As shown in FIG. 5A, since all black pixels are continuous after labeling 
the first character, only one group bounding rectangle is extracted. On the other 
hand, when the second and third characters are labeled, the second character is 
divided into three groups of the group a, group b and group c. And, among these 
groups the groups a and c overlap each other. And, the third character is divided 
into two groups of the group d and group e which overlaps each other. 
[0036] The overlapping groups are integrated as shown in FIG. 5B. The third 
character is integrated into single group g. However, the integrated group f and 
the other group b further overlap in the second character. 

[0037] The overlap integrating process unit 5 integrates the group obtained by 
the integration with a further group when they overlap each other to form a group 
h as shown in FIG. 5C. Thus, the overlap integrating process unit 5 continues 
integrating groups until no more overlapping group bounding rectangles are 
detected in the row rectangle. 
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[0038] As described above, the number of obtained group bounding rectangles 
after performing the overlap integrating process in the row rectangle is assumed to 
be B, and the number of the group bounding rectangles contained in the row 
rectangle before performing the overlap integrating process is assumed to be A. 
Then, the ratio of X of the number of the group bounding rectangles before the 
overlap integrating process to the number of group bounding rectangles after the 
overlap integrating process can be obtained by the equation X = B / A. The 
language can be determined by the value of the overlap integration ratio X. 
[0039] As shown in FIG. 6A, in the case of a row of English characters, there are 
no overlapping group bounding rectangles to be integrated in almost all group 
bounding rectangles, and the overlap integration ratio X is close to 1.0 according to 
the statistical data. In FIG. 6A, the number A is 21, and the number B is 21, 
then the ratio X is 1.0. 

[0040] On the other hand, as shown in FIG. 6B, in the case of a row of Japanese 
characters, there are a number of group bounding rectangles to be treated in the 
overlap integrating process, and the overlap integration ratio of X is 0.53 which is 
much smaller than the value for the English characters. In FIG. 6B, the number 
A is 17, and the number B is 9, then the ratio X is 0.53. 

[0041] Then, the upper limit of the range of the overlap integration ratio for 
Japanese is statistically calculated to use the value as threshold. A language 
determining process unit 6 determines that the language is English when the 
value is equal to or larger than the threshold, and the language is Japanese when 
it is smaller than the threshold, thereby correctly determining the language of the 
character contained in the row rectangle. 
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[0042] When the determination is made on a single row, wrong determination 
may be made depending on the contents of the characters in the row. Therefore, 
statistical determination process unit 7 may further be provided. The statistical 
determination process unit 7 performs the language determining process on a 
plurality of rows contained in an original, and also a statistical process so that the 
language determined to be the language of the characters contained in the most 
rows can be determined as the language of the characters in the original. Thus, 
the final wrong determination of the language of the character in the original can 
be suppressed by correcting the wrong determination of the language of single row 
depending on the characters in the row, and the language can be automatically 
determined more correctly. 

[0043] When a document to be read contains only characters, the labeling 
process, the overlap integrating process, and the language determining process can 
be performed on the entire original. 

[0044] According to the present invention, the following effects can be expected. 
[0045] When characters are recognized using an OCR, etc. in the image reading 
apparatus for reading a document image containing character information, it is 
necessary to recognize the characters using the OCR engine for the language of the 
characters to be recognized. Therefore, the user of the apparatus manually sets 
the language of the original before reading the image, and the apparatus is not 
easily operable. To solve the above-mentioned problem, an image reading 
apparatus has been suggested with a plurality of OCR engines loaded on the 
apparatus for actually recognizing characters using the plurality of OCR engines, 
and automatically determining the language of the highest probability of correct 
recognition as the language of the characters contained in an original. However, 
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in the above-mentioned apparatus, it takes a long time to perform the recognition 
using the OCR, and perform the preprocess for a read of an image which is 
performed for automatically determining a language, thereby causing a delay in 
reading an image. To shorten the processing time, it is preferable to realize the 
function by hardware, but the character recognizing capability of the OCR for a 
plurality of languages, etc. cannot be realized by hardware. 
[0046] According to the present invention, the language of the characters 
contained in an original can be easily and automatically determined. 
Furthermore, since the function is not complicated, it can be realized by hardware, 
and the process can be performed at a higher speed. 
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